Exact Learning of Tree Patterns

نویسندگان

  • Thomas R. Amoth
  • Dana Angluin
  • Lisa Hellerstein
  • Roni Khardon
  • Stephen Kwek
  • David Page
چکیده

Tree patterns are natural candidates for representing rules and hypotheses in many tasks such as information extraction and symbolic mathematics. A tree pattern is a tree with labeled nodes where some of the leaves may be labeled with variables, whereas a tree instance has no variables. A tree pattern matches an instance if there is a consistent substitution for the variables that allows a mapping of subtrees to matching subtrees of the instance. A nite union of tree patterns is called a forest. In this thesis, we study the learnability of tree patterns from queries when the subtrees are ordered or unordered. The learnability is determined by the semantics of matching as de ned by the types of mappings from the pattern subtrees to the instance subtrees. Exact supervised learning is used. We rst show that ordered tree patterns and forests, with an in nite label alphabet (or equivalent condition), are learnable from equivalence (and membership) queries. Ordered forests and similar classes are shown to be as hard to learn as DNF without an in nite label alphabet or equivalent. We next show that unordered tree patterns and forests are not exactly learnable from equivalence and subset queries when the mapping between subtrees is one-to-one onto, regardless of the computational power of the learner. Tree and forest patterns are learnable from equivalence and membership queries for the one-to-one into mapping. Finally, we connect the problem of learning tree patterns to inductive logic programming by describing a class of tree patterns called Clausal trees that includes nonrecursive ingle-predicate Horn clauses and show that this class is learnable from equivalence and membership queries. Integrate divide / \ ---> / \ ^ x / \ / \ / + x n ^ / \ / \ n 1 x + / \ n 1 Figure 0.1: Simple Integration Rule Integrate Integrate Integrate / \ / \ / \ ^ x ^ x ^ x / \ / \ / \ x 3 x 5 x n (a) (b) (c) Figure 0.2: Simple Learning Illustration (Integration/Ordered Trees) Acknowledgments This research was partially supported by the NSF under grant number IRI-9520243. We thank Dana Angluin, Lisa Hellerstein, Roni Khardon, Stephen Kwek, David Page, Vijay Raghavan, and Chandra Reddy for interesting discussions on the topic of this paper. We thank the reviewers for many excellent suggestions. 1 A A A / \ / \ / \ B C B C z y / \ / \ | D E F G H (a) (b) Examples Pattern Figure 0.3: Learning with Abstract Trees 2 Chapter

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Which is effective: self-directed learning or tutor-directed learning on the level of nursing skills

Introduction. This is quasi experimental research in order to determine and compare the learning level of nursing skills ( in B.A students) with self-directed learning and tutor-directed learning pattern in Shaheed Beheshti Univeristy of Medical Sciences and Health Services, Nursing and Midwifery faculty, 1998-1999. Methods. First of all, a questionnaire composed of some demographic data such ...

متن کامل

Using an Adaptive Search Tree to Predict User Location

In this paper, we propose a method for predicting a user’s location based on their past movement patterns. There is no restriction on the length of past movement patterns when using this method to predict the current location. For this purpose, a modified search tree has been devised. The search tree is constructed in an effective manner while it additionally learns the movement patterns of a u...

متن کامل

نقش درخت زندگی در فرش های ترکمنی (با تاکید بر نقوش درخت در فرهنگ اسلامی و تمدن های باستانی)

In Islam the “Tree of Life” is named as Sedreh or Tuba and the followers of Islam believe that this tree is grown in Heaven; therefore it is an interesting subject for artistic innovative. In Turkmen terminology “tree of life” is called “Yashaish bagh”. In this study we have made an effort to evaluate the symbol of the Tree and the “Tree of Life”,...

متن کامل

Complexity of Equivalence and Learning for Multiplicity Tree Automata

We consider the complexity of equivalence and learning for multiplicity tree automata, i.e., weighted tree automata over a field. We first show that the equivalence problem is logspace equivalent to polynomial identity testing, the complexity of which is a longstanding open problem. Secondly, we derive lower bounds on the number of queries needed to learn multiplicity tree automata in Angluin’s...

متن کامل

MMDT: Multi-Objective Memetic Rule Learning from Decision Tree

In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000